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One of the main signatures of primordial non-Gaussianity of the local type is a scale-dependent 
correction to the bias of large-scale structure tracers such as galaxies or clusters, whose amplitude 
depends on the bias of the tracers itself. The dominant source of noise in the power spectrum 
of the tracers is caused by sampling variance on large scales (where the non-Gaussian signal is 
strongest) and shot noise arising from their discrete nature. Recent work has argued that one can 
avoid sampling variance by comparing multiple tracers of different bias, and suppress shot noise by 
optimally weighting halos of different mass. Here we combine these ideas and investigate how well 
the signatures of non-Gaussian fluctuations in the primordial potential can be extracted from the 
two-point correlations of halos and dark matter. On the basis of large A'^-body simulations with 
local non-Gaussian initial conditions and their halo catalogs we perform a Fisher matrix analysis of 
the two-point statistics. Compared to the standard analysis, optimal weighting and multiple-tracer 
techniques applied to halos can yield up to 1 order of magnitude improvements in /NL-constraints, 
even if the underlying dark matter density field is not known. In this case one needs to resolve all 
halos down to lO^''/i~^M0 at z = 0, while with the dark matter this is already achieved at a mass 
threshold of 10'^'^/i~^Mq. We compare our numerical results to the halo model and find satisfactory 
agreement. Forecasting the optimal /NL-constraints that can be achieved with our methods when 
applied to existing and future survey data, we find that a survey of 50/i~^Gpc'^ volume resolving 
all halos down to lO^^/i~^M0 at z — 1 will be able to obtain af^^ ~ 1 (68% cl), a factor of ~ 20 
improvement over the current limits. Decreasing the minimum mass of resolved halos, increasing 
the survey volume or obtaining the dark matter maps can further improve these limits, potentially 
reaching the level of a-f^^ ~ 0.1. This precision opens up the possibility to distinguish different 
types of primordial non-Gaussianity and to probe inflationary physics of the very early Universe. 

PACS numbers: 98.80.-k, 98.62.-g, 98.65.-r 



I. INTRODUCTION 

A detection of primordial non-Gaussianity has the po- 
tential to test today's standard inflationary paradigm 
and its alternatives for the physics of the early Universe. 
Measurements of the CMB bispectrum furnish a direct 
probe of the nature of the initial conditions (see, e.g., 
[l|-l5| and references therein), but are limited by the two- 
dimensional nature of the CMB and its damping on small 
scales. However, the non-Gaussian signatures imprinted 
in the initial fluctuations of the potential gravitation- 
ally evolve into the large-scale structure (LSS) of the 
Universe, which can be observed in all three dimensions 
and whose statistical properties can be constrained with 
galaxy clustering data (for recent reviews, see 0|)- 

One of the cleanest probes is the galaxy (or, more 
generally, any tracer of LSS including clusters, etc.) 
two-point correlation function (in configuration space) 
or power spectrum (in Fourier space), which develops 
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a characteristic scale dependence on large scales in the 
presence of primordial non-Gaussianity of the local type 
The power spectrum picks up an additional term 
proportional to /nl(^g ~ 1)i where 6g is the Gaussian 
bias of the tracer and /nl is a parameter describing the 
strength of the non-Gaussian signal. However, the pre- 
cision to which we can constrain /nl is limited by sam- 
pling variance on large scales: each Fourier mode is an 
independent realization of a (nearly) Gaussian random 
field, so the ability to determine its rms-amplitude from 
a finite number of modes is limited. Recent work has 
demonstrated that it is possible to circumvent sampling 
variance by comparing two different tracers of the same 
underlying density field [§-[ll| . The idea is to take the ra- 
tio of power spectra from two tracers to (at least partly) 
cancel out the random fluctuations, leaving just the sig- 
nature of primordial non-Gaussianity itself. 

Another important limitation arises from the fact that 
galaxies are discrete tracers of the underlying dark matter 
distribution. Therefore, with a finite number of observ- 
able objects, the measurement of their power spectrum 
is affected by shot noise. Assuming galaxies are sampled 
from a Poisson process, this adds a constant contribution 
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to their power spectrum, it is given by the inverse tracer 
number density 1/n. This is particularly important for 
massive tracers such as clusters, since their number den- 
sity is very low. Yet they are strongly biased and there- 
fore very sensitive to a potential non- Gaussian signal. 
Recent work has demonstrated the Poisson shot noise 
model to be inadequate (l3l - [T5| . In particular, [Tsl. 
have shown that a mass-dependent weighting can consid- 
erably suppress the stochasticity between halos and the 
dark matter and thus reduce the shot noise contribution. 
In view of constraining primordial non-Gaussianity from 
LSS, this can be a very helpful tool to further reduce the 
error on /nl- 

Both of these methods (sampling variance cancellation 
and shot noise suppression) have so far been discussed 
separately in the literature. In this paper we combine 
the two to derive optimal constraints on /nl that can 
be achieved from two-point correlations of LSS. We show 
that dramatic improvements are feasible, but we do not 
imply that two-point correlations achieve optimal con- 
straints in general: further gains may be possible when 
considering higher-order correlations, starting with the 
bispectrum analysis fT6|| (three-point correlations). 

This paper is organized as follows: Sec. |TT] briefly re- 
views the impact of local primordial non-Gaussianity on 
the halo bias, and the calculation of the Fisher informa- 
tion content on /nl from two-point statistics in Fourier 
space is presented in Sec. IIIIl In Sec. |IV] we apply our 
weighting and multitracer methods to dark matter halos 
extracted from a series of large cosmological A''-body sim- 
ulations and demonstrate how we can improve the /nl- 
constraints. These results are confronted with the halo 
model predictions in Sec. |V] before we finally summarize 
our findings in Sec. I VII 



II. NON-GAUSSIAN HALO BIAS 

Primordial non-Gaussianity of the local type is usually 
characterized by expanding Bardeen's gauge- invariant 
potential $ about the fiducial Gaussian case. Up to sec- 
ond order, it can be parametrized by the mapping (T7l - [20| 



peak-background split argument to the Gaussian piece of 
Bardeen's potential, one finds a scale-dependent correc- 
tion to the linear halo bias (§, uU, u3 : 



(3) 



&(fc, /nl) = &g + /nl(&g - l)w(fc, z) , 



where 6g is the scale-independent linear bias parameter 
of the corresponding Gaussian field (/nl = 0) and 
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k^T{k)D{z)c^ 
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Here, <5c — 1.686 is the linear critical overdensity for 
spherical collapse. Corrections to Eq. ([3]) beyond linear 
theory have already been worked out and agree reason- 
ably well with numerical simulations j23l - |26| . Also, the 
dependence of the halo bias on merger history and halo 
formation time affects the amplitude of the non-Gaussian 
corrections in Eq. ^ [g^, [27h29| , which we will neglect 
here. 



III. FISHER INFORMATION FROM THE 
TWO-POINT STATISTICS OF LSS 

It is believed that all discrete tracers of LSS, such as 
galaxies and clusters, reside within dark matter halos, 
collapsed nonlinear structures that satisfy the conditions 
for galaxy formation. The analysis of the full complexity 
of LSS is therefore reduced to the information content 
in dark matter halos. In this section we introduce our 
model for the halo covariance matrix and utilize it to 
compute the Fisher information content on /nl from the 
two-point statistics of halos and dark matter in Fourier 
space. We separately consider two cases: first halos only 
and second halos combined with dark matter. While the 
observation of halos is relatively easy with present-day 
galaxy redshift surveys, observing the underlying dark 
matter is hard, but not impossible: weak-lensing tomog- 
raphy is the leading candidate to achieve that. 



Covariance of Halos 



$(x) = $g(x) + /nl*gW 



(1) 



where <I>g(x) is an isotropic Gaussian random field and 
/nl a dimcnsionless phenomenological parameter. Ignor- 
ing smoothing (we will consider scales much larger than 
the Lagrangian size of a halo), the linear density pertur- 
bation Sq is related to $ through the Poisson equation in 
Fourier space. 



, , 2k^T{k)D{zy.^. 



(2) 



where T{k) is the matter transfer function and D{z) is 
the linear growth rate normalized to 1 + z. Applying the 



1. Definitions 

We write the halo overdensity in Fourier space as a 
vector whose elements correspond to TV successive bins 



= (<5hi, 5h2 7 • ■ • : ^hjv)'"' 



(5) 



In this paper we will only consider a binning in halo mass, 
but the following equations remain valid for any quan- 
tity that the halo density field depends on (e.g., galaxy- 
luminosity, etc.). The covariance matrix of halos is de- 
fined as 



(6) 
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i.e., the outer product of the vector of halo fields averaged 
within a fc-shell in Fourier space. Assuming the halos to 
be locally biased and stochastic tracers of the dark matter 
density field S, we can write 



and we define 



(5h = bS + e 



(^2 



(7) 



(8) 



as the effective bias, which is generally scale-dependent 
and non-Gaussian, e is a residual noise-field with zero 
mean and we assume it to be uncorrelated with the dark 
matter, i.e., {eS) = [30| . 

In each mass bin, the effective bias b shows a distinct 
dependence on /nl- In what follows, we will assume that 
b is linear in /nl, as suggested by Eq. ([3]): 



b(fc,/NL) 



/NLb'(fc) . 



(9) 



Here, 6g is the Gaussian effective bias and b' = db/df^i^. 
Finally, we write P = {6'^) for the nonlinear dark matter 
power spectrum and assume dP/dfm, = 0. This is a 
good approximation on large scales |3ll - [33| . Thus, the 
model from Eq. ([7]) yields the following halo covariance 
matrix: 

Ch = bb^P + £ , (10) 
where the shot noise matrix £ was defined as 

£ ^ (eeT) . (11) 

In principle, £ can contain other components than pure 
Poisson noise, for instance higher-order terms from the 
bias expansion [34i-[3&] . Here and henceforth, we will de- 
fine £ as the residual from the effective bias term bb^ P in 
Ch, and allow it to depend on /nl- Thus, with Eqs. ^ 
and (ITOl) the shot noise matrix can be written as 



(12) 



This agrees precisely with the definition given in [3] 
for the Gaussian case, however it also takes into ac- 
count the possibility of a scale-dependent effective bias in 
non-Gaussian scenarios, such that the effective bias term 
bb^P always cancels in this expression [s^]- 

Reference [l3l already investigated the Fisher informa- 
tion content on primordial non-Gaussianity for the ideal- 
ized case of a purely Poissonian shot noise component in 
the halo covariance matrix. In [l5j |. the halo covariance 
was suggested to be of a similar simple form, albeit with a 
modified definition of halo bias and a diagonal shot noise 
matrix. In this work we will consider the more general 
model of Eq. (|10|) without assuming anything about £. 



Instead we will investigate the shot noise matrix with the 
help of A^-body simulations. 

The Gaussian case has already been studied in [14|. 
Simulations revealed a very simple eigenstructure of the 
shot noise matrix: for N > 2 mass bins of equal number 
density n it exhibits a {N — 2)-dimensional degenerate 

subspace with eigenvalue Ap^ = 1/n, which is the 
expected result from Poisson sampling. Of the two re- 
maining eigenvalues A±, one is enhanced (A+) and one 
suppressed (A_) with respect to the value 1/n. The shot 
noise matrix can thus be written as 

£ = n-^I+{X+-n-'^)V+V^ + {X--fi-^)V_Vj , (13) 

where I is the N x N identity matrix and V± are the 
normalized eigenvectors corresponding to X± . Its inverse 
takes a very similar form 



n)V+ Vj + (AI^ - n)V_ Vj . (14) 



The halo model [38| can be applied to predict the func- 
tional form of A± and V± (see [l^ and Sec. |Vl. This 
approach is however not expected to be exact, as it does 
not ensure mass- and momentum conservation of the dark 
matter density field and leads to white-noise-like contri- 
butions in both the halo-matter cross and the matter 
auto power spectra which are not observed in simula- 
tions ^39(|. Yet, the halo model is able to reproduce the 
eigenstructure of £ fairly well [3] and we will use it for 
making predictions beyond our A'^-body resolution limit. 

In the Gaussian case one can also relate the dominant 
eigenmode 1+ with corresponding eigenvalue A+ to the 
second-order term arising in a local bias-expansion model 
34, 35], where the coefficients bi are determined analyt- 
ically from the peak- background split formalism given a 
halo mass function [l^, |4l[. In non- Gaussian scenarios 
this can be extended to a multivariate expansion in dark 
matter density S and primordial potential $ including 
bias coefficients for both fields [l^ HI] ■ For the calcula- 
tion of £ we will however restrict ourselves to the Gaus- 
sian case and later compare with the numerical results 
of non-Gaussian initial conditions to see the effects of 
/nl on £ and its eigenvalues. The suppressed eigenmode 
V- with eigenvalue A_ can also be explained by a halo- 
exclusion correction to the Poisson-sampling model for 
halos, as studied in |3^ . 

In what follows, we will truncate the local bias expan- 
sion at second order. Therefore, we shall assume the 
following model for the halo overdensity in configuration 
space 



^h(x) = 6i(5(x) + 62<5^(x) + np(x) + nc(x) 



(15) 



Here, np is the usual Poisson noise and ric a correction to 
account for deviations from the Poisson-sampling model. 
In Fourier space, this yields 

^h(k) = bi5(k) + b2 (6^6) (k) + np(k) + ne(k) , (16) 
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where the asterisk-symbol denotes a convolution. The 
Poisson noise np arises from a discrete sampling of the 
field with a finite number of halos, it is uncorrelated 
with the underlying dark matter density, (ripS) — 0, 
and its power spectrum is {upTip) — 1/n (Poisson white 
noise). We further assume (npnj) = (ricS) = 0, which 
leads to 



10' 



b = bi+b: 



{{6*6)6) 
^6^ 



(17) 



Ch - b^bl{6^) + {b^bl + b^bl) {{6*6) 6) 



b^bl{{6*6)' 



{npuj,) 



{iS*S)6)^ 



(19) 

Hence, we can identify the normalized vector ^2/162! with 
the eigenvector V+ of Eq. (fTS]) with corresponding eigen- 
value 



K = blb^Ss-^ 



where we define 



{{6*6)^ 



{{6*6) 6y^ 



(20) 



(21) 



In [36| this term is absorbed into an effective shot noise 
power, since it behaves like white noise on large scales and 
arises from the peaks and troughs in the dark matter den- 
sity field being nonlinearly biased by the &2-term [4^. We 
evaluated £g2 along with the expressions that appear in 
Eq. (f2T|) with the help of our dark matter A^-body simu- 
lations for Gaussian and non-Gaussian initial conditions 
(for details about the simulations, see Sec. lIVp. 

The results are depicted in Fig. [TJ £52 obviously shows 
a slight dependence on /nl, but it remains white-noise- 
like even in the non-Gaussian cases. The /NL-dependence 
of this term has not been discussed in the literature yet, 
but it can have a significant impact on the power spec- 
trum of high-mass halos which have a large 62-term; see 
Eq. (PO)) . A discussion of the numerical results for ha- 
los, specifically the /NL-dependence of A+, is conducted 
later in this paper. It is also worth noticing the /NL- 
dependence of {{6*6)'^) and {{5*5)6). The properties of 
the squared dark matter field (5^(x) are similar to the 
ones of halos, namely, the A;~^-correction of the effective 
bias in Fourier space, which in this case is defined as 
bs2 EE {{5*5) 6)/ {6^) and appears in Eq. p71) . 

The last term in Eq. (flQl) corresponds to the suppressed 
eigenmode of the shot noise matrix. Both its eigenvector 
and eigenvalue can be described reasonably well by the 
halo model [l3|- The argument of [SS^ based on halo 
exclusion yields a similar result while providing a more 
intuitive explanation for the occurrence of such a term. 
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FIG. 1. Shot noise £g2 of the squared dark matter density field 
5^ as defined in Eq. (|21|1 with both Gaussian (solid green) and 
non-Gaussian initial conditions with /nl = -1-100 (solid red) 
and /nl = —100 (solid yellow) from iV-body simulations at 
z = 0. Clearly, £g2 is close to white-noise like in all three 
cases. The auto power spectrum {{6*5)'^) of in Fourier 
space (dashed), its cross power spectrum {{S*S)S) with the 
ordinary dark matter field 5 (dot-dashed) , as well as the ordi- 
nary dark matter power spectrum {5^} (dotted) are overplot- 
ted for the corresponding values of /nl- The squared dark 
matter field can be interpreted as a biased tracer of 6 and 
therefore shows the characteristic /NL-dependence of biased 
fields (like halos) on large scales. 



2. Likelihood and Fisher information 

In order to find the best unbiased estimator for /nl, 
we have to maximize the likelihood function. Although 
we are dealing with non-Gaussian statistics of the density 
field, deviations from the Gaussian case are usually small 
in practical applications (e.g., [12, 13, EJ]), so we will 
consider a multivariate Gaussian likelihood 



1 



(27r)^/Vdet Ch 



exp 1 --<5;C-M, ) . (22) 



Maximizing this likelihood function is equivalent to min- 
imizing the following chi-square, 

- SlC^^S^^ + In (1 + a) + ln(det £) , (23) 

where we dropped the irrelevant constant Afln(27r) and 
used 

det Ch = det {bb'' P + £) ^ {I + a) det £ , (24) 

with a = b^£^^bP. For a single mass bin, Eq. 
simplifies to 



SI 



b^P + £ 



\n(b^P + £) 



(25) 
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The Fisher information matrix for the parameters 
9i and 9j and the random variable <5h with covariance Gh, 
as derived from a multivariate Gaussian likelihood [4a, 
lirl. reads 



1. 



-Tr 



9Ch i9Ch 1 
d9, h Q0^ '-h 



(26) 



With the above assumptions, the derivative of the halo 
covariance matrix with respect to the parameter /nl is 

5Ch 



= (bb'^ + b'b^) p + e' , 



(27) 



9/nl 

with S' = i95/i9/nl- The inverse of the covariance ma- 
trix can be obtained by applying the Sherman-Morrison 
formula M, SI 



£-^bb^£~^P 



1 



a 



(28) 



where again a = b^E^bP. On inserting the two previ- 
ous relations into Eq. ([26l) . we eventually obtain the full 
expression for -F/nl/nl in terms of 6, b' , £' and P (see 
Appendix El for the derivation of Eq. (jAllI) '). Neglecting 
the /NL-dependence of £, i.e., setting £' = 0, the Fisher 
information on /nl becomes 



a7 + + a (a7 - P^) 



(29) 



with a = b''£-^bP, 13 = b^£-^b'P and 7 = ^E-^b'P. 
For a single mass bin, Eq. (jAlip simplifies to Eq. (|A12p . 



bb'P + 572 
62p + £ 



(30) 



This implies that even in the limit of a very well-sampled 
halo density field (n — > 00) with negligible shot noise 
power £ (and neglecting £') the Fisher information con- 
tent on /nl that can be extracted per mode from a single 
halo mass bin is limited to the value 2 {b' /hf' . This is due 
to the fact that we can only constrain /nl from a change 
in the halo bias relative to the Gaussian expectation, not 
from a measurement of the effective bias itself. The lat- 
ter can only be measured directly if one knows the dark 
matter distribution, as will be shown in the subsequent 
paragraph. However, the situation changes for several 
halo mass bins (multiple tracers as in [§]). In this case, 
the Fisher information content from Eqs. (jAll[) and (|29p 
can exceed the value 2 (5'/6) (see Sec. llVland IV)) . 



B. Covariance of Halos and Dark Matter 

1. Definitions 

We will now assume that we possess knowledge about 
the dark matter distribution in addition to the halo den- 
sity field. In practice one may be able to achieve this by 



combining galaxy redshift surveys with lensing tomogra- 
phy [H^l, but the prospects are somewhat uncertain. We 
will simply add the dark matter overdensity mode 5 to 
the halo overdensity vector Jh, defining a new vector 



(31) 



In analogy with the previous section, we define the co- 
variance matrix as C = {S5'^) and write 



{S') (SIS) 



P b'^P 
bP Ch 



(32) 



2. Likelihood and Fisher information 

Upon inserting the new covariance matrix into the 
Gaussian likelihood as defined in Eq. (P^ . we find the 
chi-square to be 



X 

where we used 



2 = ^Tc~M +ln(detf:) , 



(33) 



detC = detChdet {P - b'^ C^^bP^) = Pdet5 , (34) 

and we still assume P to be independent of /nl and there- 
fore drop the term In (P) in Eq. ([33| . In terms of the halo 
and dark matter overdensities, the chi-square can also be 
expressed as 

= {K - bSy £-^ (<5h - bS) + In (det £) , (35) 

which is equivalent to the definition in [l^ (where the 
last term was neglected). The corresponding expression 
for a single halo mass bin reads 



For the derivative of C with respect to /nl we get 
dC 



NL 



b'Tp 
b'P b^P + b'b^P + £' 



(36) 



(37) 



Performing a block inversion, we readily obtain the in- 
verse covariance matrix, 



(l + a)p-i W£-^ 



£ 'b 



£-' 



(38) 



As shown in Appendix [BJ the Fisher information content 
on /nl now becomes 



with 7 = b'^£~^b'P and r = ^Tr {£' £-^ £' £-^'' 
single halo mass bin this further simplifies to 

2 



6'2p 

~ ^ 2 



(39) 



For a 



(40) 
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It is worth noting that, in contrast to Eq. (1501) . the Fisher 
information from one halo mass bin with knowledge of 
the dark matter becomes infinite in the limit of vanish- 
ing £ . In this limit the effective bias can indeed be deter- 
mined exactly, allowing an exact measurement of /nl • 



IV. APPLICATION TO N-BODY SIMULATIONS 

We employ numerical A^-body simulations with both 
Gaussian and non-Gaussian initial conditions to find sig- 
natures of primordial non-Gaussianity in the two-point 
statistics of the final density fields in Fourier space. More 
precisely, we consider an ensemble of 12 realizations of 
box-size 1.6/i~^Gpc (this yields a total effective volume of 
Ves ~ 50ft.~'^Gpc'^). Each realization is seeded with both 
Gaussian (/nl = 0) and non-Gaussian (/nl = ±100) ini- 
tial conditions of the local type [3l|, and evolves 1024^^ 
particles of mass 3.0 x 10^^/i~"'^Mq. The cosmological 
parameters are = 0.279, 17a = 0.721, fib = 0.046, 
(Js = 0.81, rts = 0.96, and h — 0.7, consistent with the 
WMAP5 [5l|] best-fit constraint. Additionally, we con- 
sider one realization with each /nl — 0, ±50 of box-size 
l.S/i-^Gpc with 1536^ particles of mass 4.7 x IO^O/i-IMq 
to assess a higher-resolution regime. The simulations 
were performed on the supercomputer ZBOx3 at the Uni- 
versity of Ziirich with the gadget ii code [l^l . The ini- 
tial conditions were laid down at redshift z = 100 by per- 
turbing a uniform mesh of particles with the Zel'dovich 
approximation. 

To generate halo catalogs, we employ a friends-of- 
friends (EOF) algorithm [53] with a linking length equal 
to 20% of the mean interparticle distance. For compar- 
ison, we also generate halo catalogs using the AHF halo 
finder developed by 5J], which is based on the spherical 
overdensity (SO) method [s^. In this case, we assume an 
overdensity threshold Ac(2;) being a decreasing function 
of redshift, as dictated by the solution to the spherical 
collapse of a tophat perturbation in a ACDM Universe 
[56j . In both cases, we require a minimum of 20 parti- 
cles per halo, which corresponds to a minimum halo mass 
Mmin ^ 5.9 X IQ^'^H-'^Mq for the simulations with 1024^ 
particles. For Gaussian initial conditions the resulting to- 



tal number density of halos is n ~ 7. Ox 10 ''/I'^Mpc and 
4.2 X lO-^/i^Mpc"^ for the EOF and SO catalogs, respec- 
tively. Note that the FOE mass estimate is on average 
20% higher than the SO mass estimate. For our 1536'^- 
particles simulation we obtain Mmin — 9.4 x 10^^/i~^Mq 
and fi ~ 4.0 X lO^^ft-^Mpc"^ resulting from the EOF halo 
finder. 

The binning of the halo density field into N consecutive 
mass bins is done by sorting all halos by increasing mass 
and dividing this ordered array into bins with an equal 
number of halos. The halos of each bin i € [1 . . . A^] are 
selected separately to construct the halo density field • 
The density fields of dark matter and halos are first com- 



puted in configuration space via interpolation of the par- 
ticles onto a cubical mesh with 512^ grid points using a 
cloud- in-cell mesh assignment algorithm [53] • We then 
perform a fast fourier transform to compute the modes 
of the fields in fc-space. 

For each of our Gaussian and non-Gaussian realiza- 
tions, we match the total number of halos to the one real- 
ization with the least amount of them by discarding halos 
from the low-mass end. This abundance matching tech- 
nique ensures that we eliminate any possible signature of 
primordial non-Gaussianity induced by the unobservable 
/NL-dependence of the halo mass function. It guarantees 
a constant value 1 /n of the Poisson noise for both Gaus- 
sian and non-Gaussian realizations. A dependence of the 
Poisson noise on /nl would complicate the interpreta- 
tion of the Fisher information content. Note also that, 
in order to calculate the derivative of a function with 
respect to /nl, we apply the linear approximation 



OF J-(/] 



nl 



-100) - F{h 



NL 



-100) 



NL 



2 X 100 



(41) 



which exploits the statistics of all our non-Gaussian runs. 
All the error bars quoted in this paper are computed from 
the variance amongst our 12 realizations. 



A. Effective bias and shot noise 

At the two-point level and in Fourier space, the clus- 
tering of halos as described by Eq. pUj) is determined 
by two basic components: effective bias and shot noise. 
Since the impact of primordial non-Gaussianity on the 
nonlinear dark matter power spectrum P is negligible on 
large scales (see Fig. [1]), the dependence of both b and 
£ on /nl must be known if one wishes to constrain the 
latter. In the following sections, we will examine this 
dependence in our series of A^-body simulations. 



1 . Effective 



In the top left panel of Fig. [21 the effective bias b in 
the fiducial Gaussian case (/nl = 0) is shown for 30 
consecutive EOF halo mass bins as a function of wave 
number. In the large-scale limit fc — >■ 0, the measure- 
ments are consistent with being scale- independent, as in- 
dicated by the dotted lines which show the average of 
6(fc, /nl = 0) over all modes with k < 0.032/iMpc"\ 
denoted 6g- At larger wave numbers, the deviations 
can be attributed to higher-order bias terms, which are 
most important at high mass. Relative to the low-fc av- 
eraged, scale- independent Gaussian bias bo, these cor- 
rections tend to suppress the effective bias at low mass, 
whereas they increase it at the very high-mass end (see 
Eq. (ITT)) '). The right panel of Fig. [2] shows the large-scale 
average be a-s a function of halo mass, as determined 
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FIG. 2. LEFT: Gaussian effective bias (top) and its derivative with respect to /nl (bottom) for the case of 30 mass bins. The 
scale-independent part be is plotted in dotted lines for each bin; it was obtained by averaging all modes with k < 0.032/iMpc~^. 
RIGHT: Large-scale averaged Gaussian effective bias be from the left panel (dotted lines) plotted against mean halo mass. 
The solid line depicts the linear-order bias derived from the peak-background split formalism. All error bars are obtained from 
the variance of our 12 boxes to their mean. Results are shown for FOF halos at z = 0. 



from 30 halo mass bins, each with a number density of 
n ~ 2.3 X 10~^/i^Mpc~^. The soHd line is the linear-order 
bias as derived from the peak-background split formal- 
ism [iol . . We find a good agreement with our iV-body 
data, only at masses below ~ 8 x 10^^/i~^Mq deviations 
appear for halos with less than ^ 30 particles [ssl j. 

The bottom left panel of Fig. [5] depicts the derivative 
of b with respect to /nl for each of the 30 mass bins. 
The behavior is well described by the linear theory pre- 
diction of Eq. ([3]), leading to a fc^^-dependence on large 
scales which is more pronounced for more massive ha- 
los ('for quantitative comparisons with simulations, see 
[31I, m, 113). Thus, the amplitude of this effect grad- 
ually diminishes towards smaller scales and even disap- 
pears around k ^ 0.1/iMpc~^. Note that [3l'| argued for 
an additional non-Gaussian bias correction which follows 
from the /NL-dependence of the mass function. This k- 
indepcndent contribution should in principle be included 
m Eq. dS]). However, as can be seen in the lower left plot, 
it is negligible in our approach (i.e., all curves approach 
zero at high fc) owing to the matching of halo abundances 
between our Gaussian and non-Gaussian realizations. 



2. Shot noise matrix 

The shot noise matrix £ has been studied using simu- 
lations with Gaussian initial conditions in [TJ] ■ Figure [3] 
displays the eigenstructure of this matrix for /nl = 
(solid curves) and /nl = ±100 (dashed and dotted 
curves). The left panel depicts all the eigenvalues (top) 
and their derivatives with respect to /nl (bottom) , while 



the right panel shows the two important eigenvectors V+ 
and V- (top) along with their derivatives (bottom) . The 
eigenstructure of £ is accurately described by Eq. 
even in the non-Gaussian case. Namely, we still find one 
enhanced eigenvalue A4. and one suppressed eigenvalue 

A_. The remaining N — 2 eigenvalues Ap^ are degen- 
erate with the value l/n, the Poisson noise expectation. 
This means that our Gaussian bias-expansion model from 
Eq. still works to describe £ in the weakly non- 

Gaussian regime. 

Note however that, owing to sampling variance, the de- 
composition into eigenmodes becomes increasingly noisy 
towards larger scales. This leads to an artificial breaking 
of the eigenvalue degeneracy which manifests itself as a 
scatter around the mean value 1/n. This scatter is the 
major contribution of sampling variance in the halo co- 
variance matrix Ch. Although we can eliminate most of 
it by setting Ap^"^^ = 1/n, a residual degree of sampling 
variance will remain in A4- and A_ , as well as in b and P. 

As is apparent from the left panel in Fig. [31 the domi- 
nant eigenvalue A+ exhibits a small, but noticeable /NL- 
dependence similar to that of £52 in Fig.[Tl which is about 
2% in this case. Its derivative, dX+/dfj<si,, clearly domi- 
nates the derivative of all other eigenvalues (which are all 
consistent with zero due to matched abundances). Only 
the derivative of the suppressed eigenvalue A_ shows a 
similar /NL-dependence of ^ 2%, albeit at a much lower 
absolute amplitude. To check the convergence of our re- 
sults, we repeated the analysis with 100 and 200 bins 
and found both derivatives of A+ and A_ to increase, 
supporting an /NL-dependence of these eigenvalues. 

By contrast, the eigenvectors V+ and V shown in 
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FIG. 3. Eigenvalues (left panel) and eigenvectors (right panel) of the shot noise matrix £ for /nl = (solid), +100 (dashed) and 
— 100 (dotted) in the case of 30 mass bins. Their derivatives with respect to /nl are plotted underneath. For clarity, only the 
two eigenvectors V± along with their derivatives are shown in the right panel. The straight dotted line in the upper left panel 
depicts the value 1/n and the red (dot-dashed) curve in the top right panel shows 62 (M) computed from the peak-background 
split formalism, scaled to the value of V+ at M ~ 3 x lO^''/i~^M0. Results are shown for FOF halos at z = 0. 



the right panel of Fig. [3] exhibit very little dependence 
on /nl (the different lines are all on top of each other). 
The derivatives of V+ and VL with respect to /nl shown 
in the lower panel reveal a very weak sensitivity to /nl 
which is less than 0.5% for most of the mass bins (for 
the most massive bin it reaches up to 1%). We repeated 
the same analysis with 100 and 200 mass bins and found 
that the relative differences between the measurements in 
Gaussian and non-Gaussian simulations further decrease. 
We thus conclude that the eigenvectors and V_ can 
be assumed independent of /nl to a very high accuracy. 

Our findings demonstrate that the two-point statis- 
tics of halos are sensitive to primordial non-Gaussianity 
beyond the linear-order effect of Eq. ^ derived in 
[IjIMlllll- However, the corrections are tiny if one con- 
siders a single bin containing many halos of very different 
mass (see ^37] ) due to mutual cancellations from 62-terms 
of opposite sign. Only two specific eigenmodes of the shot 
noise matrix (corresponding to two different weightings of 
the halo density field) inherit a significant dependence on 
/nl- This is most prominently the case for the eigenmode 
corresponding to the highest eigenvalue A+ . Its eigenvec- 
tor, V^, is shown to be closely related to the second-order 
bias 62 in Eq. (fTO|) . As can be seen in the upper right 
panel of Fig. [31 V+ measured from the simulations, and 
the function 62 (M) calculated from the peak-background 
split formalism [lo, , agree closely (note that 62 {M) 
has been rescaled to match the normalized vector V+). 

In the continuous limit this implies that weighting the 
halo density field with 62 (-^^ ) selects the eigenmode with 
eigenvalue A+ given in Eq. (l20l) . Since A+ depends on /nl 
through the quantity Egi defined in Eq. (pij) . the result- 



ing weighted field will show the same /NL-dependence. 
However, this /NL-depcndcnce cannot immediately be ex- 
ploited to constrain primordial non-Gaussianity, because 
the Fourier modes of f 52 are heavily correlated due to 
the convolution of (5 with itself in Eq. (HH, and thus do 
not contribute to the Fisher information independently. 
The bottom line is that for increasingly massive halo bins 
with large 62 , the term £^1 makes an important contribu- 
tion to the halo power spectrum and shows a significant 
dependence on /nl- It is important to take into account 
this dependence when attempting to extract the best-fit 
value of /nl from high-mass clusters, so as to avoid a 
possible measurement bias. Although it provides some 
additional information on /nl, we will ignore it in the 
following and quote only lower limits on the Fisher infor- 
mation content. 



B. Constraints from Halos and Dark Matter 

Let us first assume the underlying dark matter density 
field 8 is available in addition to the galaxy distribution. 
Although this can in principle be achieved with weak- 
lensing surveys using tomography, the spatial resolution 
will not be comparable to that of galaxy surveys. To 
mimic the observed galaxy distribution we will assume 
that each dark matter halo (identified in the numerical 
simulations) hosts exactly one galaxy. A further refine- 
ment in the description of galaxies can be accomplished 
with the specification of a halo occupation distribution 
for galaxies p^ . [60j . but we will not pursue this here. 
Instead, we can think of the halo catalogs as a sample 
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of central halo galaxies from which satellites have been 
removed. We also neglect the effects of baryons on the 
evolution of structure formation, which are shown to be 
marginally influenced by primordial non-Gaussianity at 
late times 



61| 



1. Single tracer: uniform weighting 

In the simplest scenario we only consider one sin- 
gle halo mass bin. In this case, all the observed halos 
(galaxies) of a survey are correlated with the underly- 
ing dark matter density field in Fourier space to deter- 
mine their scale-dependent effective bias, which can then 
be compared to theoretical predictions. In practice, this 
translates into fitting our theoretical model for the scale- 
dependent effective bias, Eq. ([3]), to the Fourier modes of 
the density fields and extracting the best fitting value of 
/nl together with its uncertainty. For a single halo mass 
bin, we can employ Eq. (j36p and sum over all the Fourier 
modes. 

In the Gaussian simulations, we measure the scale- 
independent effective bias 6g via the estimator {5\^6) / (S^) 
and the shot noise £ via ((5h — bQ6)'^), and average over 
all modes with k < 0.032ft,Mpc^^. In practice, &g and 
£ are not directly observable, but a theoretical predic- 
tion based on the peak-background split [13, El and the 
halo model [l^ provides a reasonable approximation to 
the measured &g and £, respectively, (see Sec. [V]). Note 
that for bins covering a wide range of halo masses, the 
/NL-dependence of the shot noise is negligible [131 and it 
is well approximated by its Gaussian expectation. 

Figure S] shows the best fits of Eq. (jS]) to the simu- 
lations with /nl = 0, ±100 using all the halos of our 
FOE (left panel) and SO catalogs (right panel). In order 
to highlight the relative influence of /nl on the effec- 
tive bias, we normalize the measurements by the large- 
scale Gaussian average be and subtract unity. The re- 
sulting best-flt values of /nl along with their one-sigma 
errors are quoted in the lower right for each case of initial 
conditions. The 68%-confidence region is determined by 
the condition Ax^(/nl) — 1- Note that we include only 
Fourier modes up to fc ~ 0.032/iMpc~^ in the fit, as linear 
theory begins to break down at higher wave numbers. 

Obviously, the best-fit values for /nl measured from 
the EOF halo catalogs are about 20% below the input 
values. A suppression of the non- Gaussian correction to 
the bias of FOE halos has already been reported by [s^ 
[sot . These authors showed that the replacement — 5> qSc 
with q — 0.75 in Eq. ^ yields a good agreement with 
their simulation data. In our framework, including this 
"g-factor" is equivalent to exchanging /nl — >■ /nl/? and 
'^/nl ^ '^/nl/'?' owing to the linear scaling of Eq. ^ 
with Sc ■ Repeating the chi-square minimization with q = 
0.75 yields best-fit values that are consistent with our 
input values, namely /nl = +107.0 ±8.3, +1.8 ±8.7 and 



— 104.0 ± 8.5. In fact, the closest match to the input 
/NL-values is obtained for a slightly larger 9 of ~ 0.8. 

Note that [1^ attributed this suppression to ellipsoidal 
collapse. However, this conclusion seems rather unlikely 
since ellipsoidal collapse increases the collapse threshold 
or, equivalently, implies q > 1 (6^ . A more sensible ex- 
planation arises from the fact that a linking length of 0.2 
times the mean interparticle distance can select regions 
with an overdensity as low as A ^ 1/0.2'^ = 125 (with 
respect to the mean background density Pm), which is 
much less than the virial overdensity Ac(z = 0) 340 
associated with a linear overdensity 5c (see [sgI [gsI W^). 
Therefore, we may reasonably expect that, on average, 
FOE halos with this linking length trace linear overden- 
sities of height less than Sc ■ 

In the case of SO halos, however, we observe the oppo- 
site trend. As is apparent in the right panel of Fig. 21 the 
model from Eq. ^ overestimates the amplitude of pri- 
mordial non-Gaussianity by roughly 40%. This is some- 
what surprising since the overdensity threshold Ac ~ 340 
used to identify the SO halos at z = is precisely the 
virial overdensity predicted by the spherical collapse of a 
linear perturbation of height dc- As we will see shortly, 
however, an optimal weighting of halos can remove this 
overshoot and therefore noticeably improve the agree- 
ment between model and simulations. 



2. Single tracer: optimal weighting 



As demonstrated in [lj|, the shot noise matrix £ ex- 
hibits nonzero off-diagonal elements from correlations be- 
tween halos of different mass. Thus, in order to extract 
the full information on halo statistics, it is necessary to 
include these correlations into our analysis. For this pur- 
pose, we must employ the more general chi-square of 
Eq. ([55]). The halo density field is split up into N con- 
secutive mass bins in order to construct the vector ^h, 
and the full shot noise matrix £ must be considered. 

However, this approach can be simplified, since we 
know that £ exhibits one particularly low eigenvalue A_ . 
Because the Fisher information content on /nl from 
Eq. is proportional to the inverse of £ (this is true 
at least for the dominant part 7), it is governed by the 
eigenmode corresponding to this eigenvalue. In it has 
been shown that this eigenmode dominates the clustering 
signal-to-noise ratio. In the continuous limit (infinitely 
many bins), it can be projected out by performing an 
appropriate weighting of the halo density field. The cor- 
responding weighting function, denoted as modified mass 
weighting with functional form 



w(M) = M + Mo , 



(42) 



was found to minimize the stochasticity of halos with 
respect to the dark matter. Here, M is the individual 
halo mass and Mq a constant whose value depends on 
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FIG. 4. Relative scale dependence of the effective bias from all FOF (left panel) and SO halos (right panel) resolved in our 
N-body simulations (Mmin — 5.9 x lO^^/i~^M0), which are seeded with non-Gaussian initial conditions of the local type with 
/nl = -1-100,0,-100 (solid lines and data points from top to bottom). The solid lines show the best fit to the linear theory 
model of Eq. taking into account all the modes to the left of the arrow. The corresponding best-fit values are quoted in the 
bottom right of each panel. The dotted lines show the model evaluated at the input values /nl ~ -1-100,0, —100. The results 
assume knowledge of the dark matter density field and an effective volume of VoH — 50/i~^Gpc^ at z — 0. 
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FIG. 5. Same as Fig. |4l but for weighted halos that have minimum stochasticity relative to the dark matter. Note that the 
one-sigma errors on /nl are reduced by a factor of ~ 5 compared to uniform weighting. In the case of SO halos the input values 
for /nl are well recovered by the best-fit, while FOF halos still show a suppression of ~ 20% (g ~ 0.8) in the best-fit /nl. 



the resolution of the simulation. It is approximately 3 
times the minimum resolved halo mass Mmin, so in this 
case Mo ~ 1.8 x IO^/i-^Mq. The weighted halo density 
field is computed as 

where we have combined the weights of the individual 
mass bins into a vector w in the last expression. Be- 
cause the chi-square in Eq. psp is dominated by only one 



eigenmode, it simplifies to the form of Eq. ([55)1 with the 
halo field (5h being replaced by the weighted halo field 5u] ■ 
Note also that bo and £ have to be replaced by the cor- 
responding weighted quantities (see |14|). 

The results are shown in Fig. [5] for both FOF and SO 
halos. We observe a remarkable reduction in the error 
on /nl by a factor of '-^ 4 — 6 (depending on the halo 
finder) when replacing the uniform sample used in Fig. 2] 
by the optimally weighted one. While for the FOF halos 
the predicted amplitude of the non-Gaussian correction 
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to the halo bias stih shows the 20% suppression (again, 
this can be taken into account by introducing a g-factor 
into our fit), for the SO halos the best- fit values of /nl 
now agree much better with the input values, i.e., q ~ 1. 

Therefore, the large discrepancy seen in Fig. 2] presum- 
ably arises from noise in the SO mass assignment at low 
mass. To ascertain whether this is the case, we repeat 
the analysis, increasing the threshold for the minimum 
number of particles per halo and discarding all halos be- 
low that threshold. If this threshold reaches 40 particles 
per halo, we find the best-fit /nl to be much closer to the 
input values, namely /nl — +102.6 ±4.6, +2.1 ±4.5 and 
— 103.5 ± 4.4. This suggests that most of the discrep- 
ancy seen in the right panel of Fig;_H] is due to poorly 
resolved halos of mass M < 2Mi„in [Sal ■ However, modi- 
fied mass weighting removes this discrepancy since halos 
at low mass are given less weight. 

Our findings are consistent with the ones of [sll , where 
the non- Gaussian bias of SO halos has been measured 
also at hi ghe r redshifts and mass thresholds, and the 
results of [6a|, where the fractional deviation from the 
Gaussian mass function for both EOF and SO halos was 
presented (see their Fig. 5). The remarkable improve- 
ment in the constraints on /nl follows from the fact that 
the stochasticity (shot noise) of the optimally weighted 
halo density field is strongly suppressed with respect to 
the dark matter [31 ■ This means that the fluctuations 
of the halo and the dark matter overdensity fields are 
more tightly correlated and the variance of the estima- 
tor ((5h<5)/((5^) for the effective bias is minimized. Also, 
cosmic variance fiuctuations inherent in both S and 
are canceled in this ratio (see Appendix [C|) . Since the 
scale dependence of this estimator is a direct probe of pri- 
mordial non-Gaussianity, the error on /nl is significantly 
reduced. At the same time, modified mass weighting in- 
creases the magnitude of be- We will show below that 
the constraints on /nl are indeed optimized with this 
approach. 

Finally, we can test our assumption about the likeli- 
hood function as defined in Eq. ([^ being of a Gaus- 
sian form and thus yielding the correct Fisher informa- 
tion. Non-Gaussian corrections could arise from corre- 
lated fc-modes in the covariance matrix (as present in 
the eigenmode A+ of the shot noise matrix), preventing 
the Fisher information from being a single integral over 
k. The error ab on the effective bias in Figs. |4] and [5] 
is determined from the variance amongst our sample of 
12 realizations and thus provides an independent way of 
testing the value for af^,^: from Eq. ^ we can deter- 
mine CT/pjL — ff,/(6G — ^)u{k,z) and compare it to the 
value obtained from the chi-square fit with Eq. p5p . Ap- 
plying the two methods, we find no significant differences 
in (T/j^L 1 so at least up to the second moment of the likeli- 
hood function, the assumption of it being Gaussian seems 
reasonable for the considered values of /nl- 



3. Multiple tracers 

Let us now estimate the minimal error on /nl achiev- 
able with a given galaxy survey for the general case, di- 
viding halos into multiple mass bins. The Fisher informa- 
tion is given by Eq. (P^)) or (I^Hl) . depending on whether 
the dark matter density field is known or not, and the 
minimal error on /nl is determined via integration over 
all observed modes in the volume V, 

<^]L = ^ ^/N./N. (fc) fc'dfc . (44) 

The largest modes with wave number /cmin = 27r/_Lbox — 
0.0039ft.Mpc"^ available from our iV-body simulations 
are smaller than the largest modes in a survey of 
50/i"^Gpc^ volume (/cmin ^ 0.0017ft.Mpc"^ ), since 
we only obtain an effective volume by considering 12 
smaller simulation boxes. Because the signal from /nl 
is strongest at low fc, our results slightly underestimate 
the total Fisher information. However, we can roughly 
estimate that on larger scales (fcmin < 0.0039/iMpc~^), 
Ff^UNAk) - u^{k)P{k) - A:-4fc"» [see Eqs. Q, 1^ 
and ([551) . as well as Figs. M and H], and thus ct/^l 
In (/cniax/^min) assuming ~ 1, a relatively weak 
dependence on /cmin- In our case this amounts to an 
overestimation of ct/^l by roughly 20%. 

Note that we only consider the /NL-/NL-element of 
the Fisher matrix. In principle we would have to con- 
sider various other parameters of our cosmology and 
then marginalize over them, i.e., compute (F~^) ^ , 

[6(a] . However, any degeneracy with cosmological param- 
eters is largely eliminated when multiple tracers are con- 
sidered, since the underlying dark matter density field 
mostly cancels out in this approach A mathematical 
demonstration of this fact is presented in Appendix [C] 

Recent studies have developed a gauge-invariant de- 
scription of the observable large-scale power spectrum 
consistent with general relativity |67H73l |. In particular, 
it has been noted that the general relativistic corrections 
to the usually adopted Newtonian treatment leave a sig- 
nature in the galaxy power spectrum that is very similar 
to the one caused by primordial non-Gaussianity of the 
local type [7^ - 176} . However, in a multitracer analysis 
the two effects can be distinguished sufficiently well, so 
that the ability to detect primordial non-Gaussianity is 
little compromised in the presence of general relativistic 
corrections [77| . 

In order to make the most conservative estimates we 
will discard all the terms featuring £ in the Fisher ma- 
trix, since it is not obvious how much information on 
/nl can actually be extracted from the shot noise ma- 
trix. £ is indeed close to a pure white-noise quantity and 
we find its Fourier modes to be highly correlated. There- 
fore, in order to extract residual information on /nl, one 
would have to decorrelate those modes through an in- 
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FIG. 6. Fisher information (left panel) and one-sigma error on /nl (right panel, fc^in = 0.0039/iMpc-\ V^a ~ 50/i-^Gpc=') 
from simulations of FOF halos and dark matter at 2 = 0. The lines show the results for 1 (solid black), 3 (dotted blue), 10 
(dashed green) and 30 (dot-dashed red) uniform halo mass bins, as well as for 1 weighted bin (long-dashed yellow). 



version of the correlation matrix among fc-bins (see j78|). 
However, in light of the limited volume of our simulations 
this can be a fairly noisy procedure, especially when the 
halo distribution is additionally split into narrow mass 
bins. Hence, for the Fisher information content on /nl 
assuming knowledge of both halos and dark matter, we 
will retain only the first term in Eq. (j39p and provide a 
lower limit: 

Ff^U.. > 7 ^ b'^e-'b'P . (45) 

To calculate -F/nl/nlj measure the functions b(fc), 
b'{k), £{k) and P{k) from our A^-body simulations (see 
Figs. [TJ [5] and [3]). In order to mitigate sampling variance 
in the multibin case, we then use Eq. to recalculate 
the shot noise matrix. Namely, we set all the eigenvalues 
Ap^ equal to the average value l/fi, and measure A+ , 
A_, as well as V+ and VL directly from the numerical 
eigendecomposition of £. 

Figured depicts Ff^-^f^-^{k) and cr/NL(fcmax) with fixed 
fc„,i„ = 0.0039/iMpc"^ for the cases of 1, 3, 10 and 30 halo 
mass bins. Clearly, the finer the sampling into mass bins, 
the higher the information content on /nl- The weighted 
halo density field with minimal stochasticity relative to 
the dark matter (corresponding to a continuous sampling 
of infinitely many bins) yields more than a factor of 6 re- 
duction in (T/^L when compared to a single mass bin of 
uniformly weighted halos. This improvement agrees rea- 
sonably well with that seen in Figs. |3] and O although 
the estimates for ct/^l slightly larger than those we 
obtained from the fitting procedure. This may be ex- 
pected, since we only obtain an upper limit on ct/nl fi^oni 
Eqs. dH]) and P5|) . 

The infiection around k ^ 0.1/iMpc^^ in -F/nl/nl 
and (J/pjL niarks a breakdown of the linear model from 



Eq. ([9|). We should not trust our results too much at 
high wave number, where higher-order contributions to 
the non-Gaussian effective bias may become important. 
It should also be noted that the inflection disappears for 
the weighted field, suggesting numerical issues to be less 
problematic in that case. 

Further improvements can be achieved when going to 
lower halo masses (see Sec.|V|: the error on /nl is propor- 
tional to the shot noise of the halo density field (Eq. (001)), 
which itself is a function of the minimum halo mass Mmin- 
References [l3, [3 numerically investigated the extent to 
which the shot noise depends on Mmin and proposed a 
method based on the halo model for extrapolating it to 
lower mass. It predicts the shot noise of the weighted 
halo density field to decrease linearly with Mj^in, antici- 
pating about 2 orders of magnitude further reduction in 
£ when resolving halos down to Mmin — lO^^ft-^^M©. In 
terms of /NL-constraints this is however somewhat mit- 
igated by the fact that the Gaussian bias also decreases 
with Mmin, so the non-Gaussian correction to the effec- 
tive bias in Eq. ([3]) gets smaller. Furthermore, [pj stud- 
ied the effect of adding random noise to the halo mass 
(to mimic scatter between halo mass and the observables 
such as galaxy luminosity), while [l^ explored the red- 
shift dependence of the optimally weighted halo density 
field and extended the method to halo occupation distri- 
butions for galaxies. 



C. Constraints from Halos 

The scenario described above is optimistic in the sense 
that it assumes the dark matter density field is available. 
In the following section we will show that it is possible to 
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FIG. 7. Relative scale dependence of the effective bias b estimated from all uniform (left panel) and weighted FOF halos (right 
panel) resolved in our N-body simulations (Mmin — 5.9 x W^^ H'^Mq), which are seeded with non-Gaussian initial conditions of 
the local type with /nl ~ -f 100, 0, —100 (solid lines and data points from top to bottom). The solid lines show the best fit to the 
linear theory model of Eq. Q, taking into account all the modes to the left of the arrow. The corresponding best-fit values are 
quoted in the bottom right of each panel. The dotted lines show the model evaluated at the input values /nl = +100, 0, —100. 
The results assume no knowledge of the dark matter density field and an effective volume of Vcb — 50/i~'^Gpc'^ at 2: = 0. 



considerably improve the constraints on /nl even with- 
out this assumption. This is perhaps not surprising in 
hght of the results in [l3, [iBl : where it was argued that 
halos can be used to reconstruct the dark matter to arbi- 
trary precision, as long as they are resolved down to the 
required low-mass threshold. 

1. Single tracer 

Considering a single halo mass bin, we must again sum 
over all the Fourier modes in Eq. (|25p and minimize this 
chi-square with respect to /nl- Although we pretend to 
have no knowledge of the dark matter distribution, we 
determine &g and £ from our simulations. In realistic 
applications, however, these quantities will have to be 
accurately modeled. In addition, we use the linear power 
spectrum Po{k) instead of the simulated nonlinear dark 
matter power spectrum P(fc) in Eq. (|25|) . 

Since, in this case, we cannot determine the scale- 
dependent effective bias directly from the estimator 
((5h<5)/((5^), we define the new estimator 

which solely depends on the two-point statistics of halos. 
In Fig. [7] we plot this estimator together with the best-fit 
solutions for the scale-dependent effective bias obtained 
from the chi-square fit of Eq. (|25|) . The left panel depicts 
the results obtained for uniform FOF halos. Compared 



to the previous case with dark matter, we observe the 
constraints on /nl to be weaker by a factor of ^ 3. The 
main reason for this difference is the fact that sampling 
variance inherent in dh is not canceled out by subtract- 
ing 6, as is done in Eq. (|36|) . This can also be seen in the 
estimator b, where a division of the smooth linear power 
spectrum Pq does not cancel the cosmic variance inher- 
ent in ((5^). Hence, b shows significantly stronger fluctu- 
ations than b = (ShS) / which demonstrates how well 
the basic idea of sampling variance cancellation works. 



Exchanging the uniform halo field with the weighted 
one, 5u], the constraints on /nl improve by about a factor 
of 2 — 3, as can be seen in the right panel of Fig. [71 
However, this improvement is mainly due to the larger 
value of 6g of the weighted sample, since the relative 
scatter among the data points remains unchanged. This 
is expected, because we do not consider a second tracer 
(e.g., the dark matter) in this case, and therefore do not 
cancel cosmic variance. 



Comparing the uncertainty on /nl obtained from 
Eq. ((23)) with the one determined via the variance of b 
amongst our 12 realizations, we can check once more the 
assumption of a Gaussian likelihood as given in Eq. ([22]) . 
Again, we find both methods to yield consistent values 
for cr/„LJ suggesting any non-Gaussian corrections to the 
likelihood function to be negligible at this order. 
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FIG. 8. Fisher information (left panel) and one-sigma error on /nl (right panel, fc^in = 0.0039/iMpc-\ V^g ~ SOh'^Gpc^) 
from simulations of FOF halos only (^ = 0). The lines show the results for 1 (solid black), 3 (dotted blue), 10 (dashed green) 
and 30 (dot-dashed red) uniform halo mass bins, as well as for 1 weighted bin (long-dashed yellow). 



2. Multiple tracers 

If we want to exploit the gains from sampling variance 
cancellation in the case where the dark matter density 
field is not available, we have to perform a multitracer 
analysis of halos (see Appendix [C]), which is the focus 
of this section. We now consider Eq. ((23|) for the chi- 
square fit. In order to calculate the Fisher information, 
we use Eq. and thus neglect any possible contribu- 
tion emerging from the /NL-dependence of the shot noise 
matrix £. 

Numerical results for the Fisher information content 
and the one-sigma error on /nl are shown in Fig. [8] for 
1, 3, 10 and 30 uniform FOF halo mass bins, as well 
as for 1 weighted bin. Clearly, the cases of 10 and 30 
uniform bins outperform a single bin of the weighted field 
in terms of Fisher information. This suggests that further 
improvements compared to the single weighted halo field 
can be achieved when all the correlations of sufficiently 
many halo mass bins are taken into account. 

In principle, we want to split the halo density field into 
as many mass bins as possible and extrapolate the results 
to the limit of infinitely many bins (continuous limit). 
Note that in the high-sampling limit of n — > (X), ^/nl/nl 
from Eq. dSO]) is hmited to 2{b'/b) , whereas the same 
quantity for several mass bins, Eq. (1291) . may surpass this 
bound (see SecIV]). In Sec. lIVBl wc showed that a single 
optimally weighted halo sample combined with the dark 



matter reaches the continuous limit in F: 



/nl/n 



which 



corresponds to a splitting into infinitely many bins in 
the multitracer approach. It is unclear, whether a similar 
goal can be achieved from halos alone, e.g., by considering 
two differently weighted tracers that would preserve all 



of the information on /nl, because we do not know the 
continuous limit of the Fisher information in that case. 
We will therefore turn to theoretical predictions by the 
halo model in the following paragraph. 



V. HALO MODEL PREDICTIONS 

A useful theoretical framework for the description of 
dark matter and halo clustering is given by the halo 
model (see, e.g., [111). Despite its limitations [11], the 
halo model achieves remarkabl e ag reement with the re- 
sults from iV-body simulations (l4l [38|. In particular, it 
provides an analytical expression for the shot noise ma- 
trix in the fiducial Gaussian case, given by 



(47) 



where Ai = M/p^-i ~ b(nM^)/2p^j and M is a vector 
containing the mean halo mass of each bin (see [3] for 
the derivation). The Poisson model is recovered when we 
set = 0. Here, b can be determined by integrating the 
peak-background split bias b(M) over the Sheth-Tormen 
halo mass function dn/dM [40] in each mass bin. The 
expression {nM'^)/p'^ originates from the dark matter 
one-halo term, so it does not depend on halo mass and 
from our suite of simulations we determine its Gaussian 
value to be ~ 418/i-3Mpc^ at z = and ~ Abh-'^Mpe' 
at z = 1. In the case of one single mass bin, Eq. (|T7)) 
reduces to £ = n"^ - 2bM/p^ + b'^ {nM"^) / p"^, while if 
we project out the lowest eigenmode VL and normalize, 
we obtain the weighted shot noise 



_ Vj£V_ _ V2V_ 



{Vlt) 



(48) 



15 





10 



10 



10'^ 10^3 10^* 



10' 



10' 



FIG. 9. Halo model predictions for the mean scale-independent Gaussian bias (left panel) and shot noise (right panel) as a 
function of minimum halo mass from uniform- (solid lines) and weighted halos (dashed lines) from a single mass bin at 2: = 
(blue) and z — 1 (red). A'^-body simulation results are overplotted, respectively, as squares and circles for different low-mass 
cuts. The dotted line in the left panel depicts be = 1, the ones in the right panel show the Poisson-model shot noise n^J.. 



The eigenvalues X± with eigenvectors V± can be found 
from Eq. (H71) . 



A4 



M'^b±VM'^M b^b , 



V± = N±'^ i^M I ^JM^M. Tb I Vh^ 
where 



(49) 
(50) 

(51) 

is a normahzation constant to guarantee V±V± — I- It is 
easily verified that VjV^ = 0, i.e., they are orthogonal. 
In the continuous limit of infinitely many bins {N — )■ 00) 
we can replace V± by the smooth function 



A4 = \ 2T'2M'^b /VM\Mb^ 



V± =Af±-' [M /^/{M^Tb 
and obtain 

= (ntoi -{Mb)- ^/WW 

h 




(V-b) 



{V- 



(52) 

(53) 
(54) 



where b^, is the weighted effective bias and we exchanged 
the vector products by integrals over the mass function: 

xTy^^ {M)x{M)y{M) dM = N{xy) , 

"-tot Jj\/„i„ dM 

(55) 



"tot 



dn 
'dM 



{M) dM = Nn . 



(56) 



Figure [9] depicts the halo model prediction for the 
scale-independent Gaussian bias &g and shot noise f as a 
function of minimum halo mass Afmin at z = and z = 1 
for both the uniform and the weighted case of a single 
mass bin. Simulation results are overplotted as symbols 
for a few Mmin [we approximate the weighting function 
V-{M) by w{M) from Eq. (O in the simulations]. Ob- 
viously, modified mass weighting increases 6g, especially 
when going to lower halo masses. It is also worth noticing 
that in contrast to the uniform case, 6g is always greater 
than unity when weighted by w{AI) (at least in the con- 
sidered mass range). Going to higher redshift further 
increases 6g at any given Mmin- 

For the shot noise we observe the opposite behav- 
ior: modified mass weighting leads to a suppression 
of f , which is increasingly pronounced towards lower halo 
masses. Moreover, it is always below the Poisson-model 
prediction of nj^j:. Our A'^-body simulation results gen- 
erally confirm this trend (at least down to our resolu- 
tion limit), although the halo model slightly underesti- 
mates the suppression of shot noise between uniform and 
weighted halos at lower Mmin. At higher redshifts, this 
suppression becomes smaller at given Mmin, but the mag- 
nitude of at z = 1 approaches the one at 2 = towards 
low Afniin and is still small compared to the Poisson- 
model prediction of n^^ot- 



A. Single tracer 

With predictions for 6g and £ at hand, we can di- 
rectly compute the expected Fisher information content 
on /nl from a single halo mass bin. If the dark mat- 
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FIG. 10. Halo model predictions for the one-sigma error on /nl (inferred from an effective volume of VcS — 50ft~^Gpc^, 
taking into account all modes with 0.0039/iMpc~^ < ^ < 0.032/iMpc~^ at 2; = 0) as a function of minimum halo mass from 
uniform- (solid lines) and weighted halos (dashed lines) from a single mass bin. The A'^-body simulation results are overplotted, 
respectively, as squares and circles for different low-mass cuts. Results that assume knowledge of halos and the dark matter are 
plotted in red (filled symbols), those that only consider halos are depicted in blue (open symbols). The dotted lines (triangles) 
show the results from splitting the halo catalog into multiple mass bins and taking into account the full halo covariance matrix 
in calculating J/nl/nl- The high-sampling limit for one mass bin (n — >■ 00, J/^l/nl ~ 2 ih' /h) ) is overplotted for the uniform- 
(thin solid line) and the weighted case (thin dashed line). Arrows show the effect of adding a log-normal scatter of crinM = 0.5 
to all halo masses, they are omitted in all cases where the scatter has negligible impact. 



ter density field is known we apply Eq. (HUl) . if it is 
not we use Eq. In order to obtain most conserva- 

tive results, we neglect terms featuring derivatives of the 
shot noise with respect to /nl- We then apply Eq. pi| 
with fcmin = 0.0039/iMpc"\ fc^ax = 0.032/iMpc^^ and 
V ~ 50/i^'^Gpc^ to compute the one-sigma error forecast 
for /nl- 

Results are shown in Fig. [10] for z — 0. When 
the dark matter density field is available (red lines and 
filled symbols), weighting the halos (red dashed lines 
and filled circles) is always superior to the conventional 
uniform case (red solid lines and filled squares), espe- 
cially when going to lower halo masses. In particular, 
(T/jjL substantially decreases with decreasing Mmin in the 
weighted case, while for uniform halos it shows a spike 
at Mmin — 1-4 X lO^^/i~^M0. This happens when 6g be- 
comes unity and the non- Gaussian correction to the halo 
bias in Eq. ^ vanishes, leaving no signature of /nl in 
the effective bias. Since in the weighted case 6g > 1 for 
all considered M^in, this spike does not appear, although 



we notice that the error on /nl begins to increase below 
M^in~ lO"/i-iM0. 

The simulation results are overplotted as symbols for 
a few values of Mmin, the agreement with the halo 
model predictions is remarkable. Note that the first two 
data-points at Afniin = 9.4 x lO^^/i'^M© and M^in = 
2.35 X IQ^'^h^^yiQ resulting from our high-resolution sim- 
ulation were scaled to the effective volume of our 12 low- 
resolution boxes. The simulations yield a minimum error 
of cr/„L ^ 0.8 at Mmin ^ IO^^/i-^Mq in the optimally 
weighted case with the dark matter available. This value 
is even lower than what is anticipated by the halo model 

(c^/nl - 1)- 

The results without the dark matter are shown as blue 
lines and open symbols. fJf^^ exhibits a minimum at 
Afmin — 10^'*/i~^Mq with cT/„L ~ 10 for both uniform 
and weighted halos. Thus, weighting the halos does 
not decrease the lowest possible error on /nl from the 
uniform expected. This suggests that only the 

highest-mass halos (clusters at 2; = 0) need to be consid- 
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FIG. 11. Same as Fig.flOlat 2 = 1. 



ered to optimally constrain /nl from a single-bin survey 
without observations of the dark matter. 

In the limit of n — >■ 00, ^/nl/nl ~^ 2{b'/b)^. Then, 
according to Eq. (O for high Mmin, b' — >■ bcu, and hence 
^/nl/nl ~^ becomes independent of Mj^in- The cor- 
responding CTj^L ill the limit n — > 00 is plotted in Fig. [TU] 
for both uniform- (thin blue solid line) and weighted ha- 
los (thin blue dashed line) and it indeed approaches a 
constant value at high My^in- It is about a factor of 2 
below the minimum in ct/nl without setting n — > 00. 

The results for redshift z = 1 are presented in Fig. [TTJ 
In comparison to Fig. (TU] one can observe that all the 
curves are shifted towards the lower left of the plot, i.e., 
the constraints on /nl improve with increasing redshift. 
This is mainly due to the increase of the Gaussian ef- 
fective bias 6g with z, as evident from the left panel 
of Fig. [9l For example, the location of the spikes in 
'^/nl (-^^min) requires 6g = 1- At z = 1 this condition 
is fulfilled at lower M,nin 5 x lO^^/i-^M©) than at 
z = 0, thus shifting the spikes to the left. Further, since 
the Fisher information from Eqs. (|30p and (|40)) increases 
with 6gi c/NLl-^min) decrcascs, especially at low Mmin- 

In the case of optimally weighted halos with knowledge 
of the dark matter, our simulations suggest (T/„l — 0-6 
when reaching My^in — 10^^h~^MQ at z = 1, in good 
agreement with the halo model. It even forecasts ct/^jl — 
0.2 when including halos down to Mj^in — lO^'^ft-^^M©. 



B. Multiple tracers 

The more general strategy for constraining /nl from 
a galaxy survey is to consider all auto- and cross- 
correlations between tracers of different mass, namely, 
the halo covariance matrix Ch. If the dark matter den- 
sity field is known, one can add the correlations with this 
field and determine C. The Fisher information on /nl is 
then given by Eq. ([^ and Eq. ([5^ , respectively. Again, 
the halo model can be applied to make predictions on the 
Fisher information content. In Appendix [Dl the analyti- 
cal expressions for a, j3 and 7 are derived for arbitrarily 
many mass bins and the continuous limit of infinite bins. 

The dotted lines in Fig.lTUlshow the halo model predic- 
tions at z = in this continuous limit of infinitely many 
mass bins. When the dark matter is available (red dotted 
line), (T/„L coincides with the results from the optimally 
weighted one-bin case (dashed red lines). This confirms 
our claim that with the dark matter density field at hand, 
modified mass weighting is the optimal choice for con- 
straining /nl and yields the maximal Fisher information 
content. Only below Mmin ~ 10"'^^/i~^Mo the optimally 
weighted halo field becomes slightly inferior to the case 
of infinite bins. 

From multiple bins of halos without the dark matter 
(blue dotted line) we observe a different behavior. While 
at high Mmin the error on /nl still matches the results 
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FIG. 12. Same as Figs. llOl andllllat higher redshifts, as indicated in the bottom right of each paneL Here, only the halo model 
predictions are shown. 



from one mass bin, either uniform (blue solid line) or 
weighted (blue dashed line), below Mmin ~ 10^'^h~^MQ 
it departs towards lower values and finally reaches the 
same continuous limit as in the case where the dark mat- 
ter is available at Mj^in ~ 10^°h~^MQ. Thus, galaxies 
in principle suffice to yield optimal constraints on /nl, 
however, one has to go to very low halo mass. 

Our simulation results for multiple bins (triangles in 
Fig. [TU]) support this conclusion. Although we can only 
consider a limited number of mass bins in the numerical 
analysis (we used = 30 for our 12 low-res boxes and 
iV = 10 for our high- res box), the continuous limit of the 
halo model can be approached closely. However, note 
that residuals of sampling variance in the numerical de- 
termination of S, as described in Sec. IIV Bl and shown in 
Fig- El can result in an overestimation of ^/nl/nl- This 
is especially the case when the number of mass bins N is 



high, resulting in a low halo number density per bin h. 
Hence, we chose TV such that the influence of sampling 
variance on our results is negligible, and yet clear im- 
provements compared to the single-tracer case are estab- 
lished. 

One concern in practical applications is scatter in the 
halo mass estimation. Although X-ray cluster-mass prox- 
ies show very tight correlations with halo mass with a 
log-normal scatter of ainM < 0.1 [zl,[i3l) optical mass- 
estimators are more likely to have ainM — 0.5 [8l|. We 
applied a log-normal mass scatter of CTinM = 0.5 to all of 
our halo masses and repeated the numerical analysis for 
all the cases (symbols) shown in Fig. [TUl The arrows in 
that flgure show the effect of adding the scatter, pointing 
to the new (higher) value of <Jfyj^. We find the effect of 
the applied mass scatter to be negligible in most of the 
considered cases (arrows omitted). Only in the case of 
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one weighted halo bin with knowledge of the dark mat- 
ter (red filled circles) we observe a moderate weakening 
in /NL-constraints, especially towards lower M„ii,i. This 
is expected, since we make most heavy use of the halo 
masses when applying modified mass weighting. Yet, the 
improvement compared to the uniform one-bin case re- 
mains substantial, so the method is still beneficial in the 
presence of mass scatter. 

At higher redshifts, we observe the same characteristics 
as in the single-tracer case: the ct/^l -curves are shifted 
towards the lower left of the plot in Fig. [TTJ due to the 
increase in the effective bias with z. Moreover, the impact 
of mass scatter on ct/nl becomes less severe at higher 
redshifts, as evident from the smaller arrows in Fig. [TT] 
as compared to Fig. [TUl High-redshift data are therefore 
more promising for constraining /nl- This is good news, 
since the relatively large effective volume assumed in the 
current analysis {VcG — 50/i~^Gpc^) can only be reached 
in practical applications when going to z ^ 1 or higher. 

On the other hand, the convergence of the constraints 
obtained with and without the dark matter is pushed 
to even lower halo masses at higher redshifts. This can 
be seen in Fig. 1121 where we show the halo model pre- 
dictions for even higher redshifts, going up to z = 5. 
With a mass threshold of M,„i„ = IO^^/i-^Mq, the opti- 
mal constraints on /nl from only halos start to saturate 
above z ~ 2, where ct/nl — 0-5. This is however not 
the case when the dark matter is available: the error 
on /nl decreases monotonically up to z = 5 reaching 
cr/„L — 0.06, although for practical purposes it will be 
difficult to achieve this limit. Yet, reaching ct/nl 1 at 
z = 1 and Mmin ^ lO^^/i^-'^M© with a survey volume of 
about 50/i~'^Gpc^ seems realistic. 



VI. CONCLUSIONS 

The aim of this work is to assess the amount of in- 
formation on primordial non-Gaussianity that can be ex- 
tracted from the two-point statistics of halo- and dark 
matter large-scale structure in light of shot noise sup- 
pression and sampling variance cancellation techniques 
that have been suggested in the literature. For this pur- 
pose we developed a theoretical framework for calculat- 
ing the Fisher information content on /nl that relies on 
minimal assumptions for the covariance matrix of halos 
in Fourier space. The main ingredients of this model 
are the effective bias and the shot noise matrix, both of 
which we measure from iV-body simulations and compare 
to analytic predictions. Our results can be summarized 
as follows: 

• On large scales the effective bias agrees well with 
linear theory predictions from the literature, while 
towards smaller scales, it shows deviations that can 
be explained by the local bias-expansion model. 



The shot noise matrix exhibits two nontrivial eigen- 
values A+ and A_, both of which show a consid- 
erable dependence on /nl- We further show that 
the eigenvector V+ is closely related to the second- 
order bias and that the corresponding eigenvalue 
A+ depends on the shot noise of the squared dark 
matter density field £52 , which itself depends on 
/nl weakly. This property can become important 
when constraining /nl from very high-mass halos 
(clusters). However, since the Fourier modes of £",52 
are highly correlated, it is questionable how much 
information on primordial non-Gaussianity can be 
gained from the /NL-dependence of the shot noise 
matrix. We demonstrate, though, that for the con- 
sidered values of /nl the assumption of a Gaussian 
form of the likelihood function is sufficient to de- 
termine the correct Fisher information. 

• With the help of A^-body simulations we demon- 
strate how the parameter /nl can be constrained 
and its error reduced relative to traditional meth- 
ods by applying optimal weighting- and multiple- 
tracer techniques to the halos. For our specific sim- 
ulation setup with A/niin ~ lO^^/i~^M0, we reach 
almost 1 order of magnitude improvements in /NL- 
constraints at z = 0, even if the dark matter density 
field is not available. The absolute constraints on 
/nl depend on the effective volume and the mini- 
mal halo mass that is resolved in the simulations, 
or observed in the data, and are expected to im- 
prove further when higher redshifts or lower-mass 
halos are considered. 

• We confirm the existence of a suppression factor 
(denoted g-factor in the literature) in the amplitude 
of the linear theory correction to the non-Gaussian 
halo bias. We argue that this only holds for ha- 
los generated with a friends-of-friends finding algo- 
rithm and depends on the specified linking length 
between halo particles. For a linking length of 20% 
of the mean interparticle distance, our simulations 
yield g ~ 0.8. For halos generated with a spherical 
overdensity finder, we demonstrate that the best- 
fit values of /nl measured from the simulations are 
fairly consistent with the input values, i.e., q ~ 1. 

• We calculate the Fisher information content from 
the two-point statistics of halos and dark matter 
in Fourier space, both analytically and numerically, 
and express the results in terms of an effective bias, 
a shot noise matrix and the dark matter power 
spectrum. In the case of a single mass bin and as- 
suming knowledge of the dark matter density field, 
the Fisher information is inversely proportional to 
the shot noise and, therefore, not bounded from 
above if the shot noise vanishes. However, when 
only the halo distribution is available, the Fisher 
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information remains finite even in the limit of zero 
sliot noise. In tliis case, the amount of information 
on /nl can only be increased by dividing the halos 
into multiple mass bins (multiple tracers). 

• Utilizing the halo model we calculate ct/nl and find 
a remarkable agreement with our simulation re- 
sults. We show that in the continuous limit of in- 
finite mass bins, optimal constraints on /nl can in 
principle be achieved even in the case where dark 
matter observations are not available. With an ef- 
fective survey volume of ~ 50h~^Gpc^ out to scales 
of fcmin — 0.004/iMpc~^ this means aj^^ ~ 1 when 
halos down to M^nin ~ 10^^h~^MQ are observed 
at z = 0. In comparison to this, a single-tracer 
method yields /NL-constraints that are weaker by 
about 1 order of magnitude. Further improvements 
are expected at higher rcdshifts and lower M^-m, 
potentially reaching the level of ct/j^l ^ O-l- 

• In realistic applications, additional sources of noise, 
such as a scatter in halo mass will have to be con- 
sidered. We test the impact of adding a log-normal 
scatter of cinA/ ~ 0.5 to our halo masses and find 
our results to be relatively unaffected. Assuming 
the dark matter to be available to correlate against 
halos is even more uncertain. Weak-lensing tomog- 
raphy can only measure the dark matter over a 
broad radial projection and more work is needed 
to see how far this approach can be pushed. More- 
over, one would also need to include weak-lensing 
ellipticity noise into the analysis, which we have not 
done here. 

We conclude that the shot noise suppression method 
(modified mass weighting) as presented in [TJ] when the 
dark matter density field is available, and the sampling 
variance cancellation technique (multiple tracers) as pro- 
posed in when it is not, have the potential to sig- 
nificantly improve the constraints on primordial non- 
Gaussianity from current and future large-scale struc- 
ture data. In [l^ it was found (their Fig. 15) that 



while the power spectrum analysis of a single tracer 
with Afniin ~ lO^*/i~^M0 (close to our optimal mass 
for a single tracer without the dark matter) predicts 
^/nl 10 for Vcff — 50h~^Gpc^ , in good agreement 
with our results, the bispectrum analysis improves this 
to CT/^L 5. Our results suggest that the multitracer 
analysis of the halo power spectrum can improve upon 
a single-tracer bispectrum analysis, potentially reaching 
significantly smaller errors on /nl- In principle the mul- 
titracer approach can also be applied to the halo bis- 
pectrum, but it is not clear how much one can benefit 
from it, since the dominant terms in the bispectrum do 
not feature any additional scale dependence that changes 
with tracer-mass. 

In this paper we only focused on primordial non- 
Gaussianity of the local type and the two-point correla- 
tion analysis. Yet, our techniques can be applied to some, 
but not all, other models of primordial non-Gaussianity, 
which have only recently been studied in simulations |82| - 
[87| . Theoretical calculations of the non-Gaussian halo 
bias generally suggest different degrees of scale depen- 
dence and amplitudes depending on the model [88l492l | . 
Our methods may help to test those various classes of 
primordial non-Gaussianity and thus provide a tool to 
probe the physics of the very early Universe. 
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Appendix A: FISHER INFORMATION ON PRIMORDIAL NON-GAUSSIANITY FROM THE 

COVARIANCE OF HALOS 



Plugging Eq. (P7)) into Eq. (|26p and using the cyclicity of the trace yields 



F 



/nl/n 



1 



b''C-^bb''^C-^b'P'^ + b^C-^b'b''C-^b'P^ + 2b'^C-^£'C-^b' P + -Tr( £'C-^£'C-^ 



Applying Eq. yields 



where 



Finally, we get 



WC-'bP = a--^^-^, 
l + a l + a 

' l + a l + a 

-a l + a 

av + Q;/?/i 
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iTr(5'C,i5'C,i 
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l + a (l + a) 



a = b^S^bP , 13 = b^£-^b'P , 7 = b''^ S'^b' P 



-6T (£-^)'bP. iy=-b^ (£-')' b'P 
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p = b^ {£-')'£{£-')' bP 
2 ^ ' 



(1 + a) (a7 + 2z/ - p) + (1 - a) /32 + (/i/2 - 2^) 
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(A4) 
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(l + a)" 

For a single mass bin we have a7 = /3^, av = /3/i, ap = p^, jp = and ar = p/2. In this case, Eq. (IA11[) becomes 



-P'/nl/nl = 2 



l + a 



bb'P + £'/2 
b^P + £ 



(A12) 
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Appendix B: FISHER INFORMATION ON PRIMORDIAL NON-GAUSSIANITY FROM THE 
COVARIANCE OF HALOS AND DARK MATTER 

Now we need to work out Eq. ([25]) by plugging in Eq. ([57| and ([55]) . Let us first note that 



9c !_/ -p b'^e-'p . 

a/NL ^ \b' ~/3b-£'£-'b bb'^e-'p + £'e-' ' ■ ^ ' 



This yields 



Ff^^f^^ = ixr [/32 + y^S-^b'P - py^S^^bP - b'''£-^£'£-^bP + b'b''^£-^P - I3bb''^£-^P 

-£'£-^bb''^£-^P + bb'''£-^bb'''£-^P^ + bb''^£-^£'£-^P + £'£-^bb''^£-^P + £'£-^£'£-^] = 

= ^{/3^ + j-l3^-iy + -f-/3^-L' + P^ + iy + iy + 2T)=-f + T. (B2) 

Appendix C: CANCELLATION OF DARK MATTER DENSITY AND COSMIC VARIANCE 

In the case where the dark matter density field is known, one can immediately see from the first term in the chi- 
square of Eq. ([55]) . that with the model — bS + e from Eq. (O, the underlying density field S is completely canceled 
(including its sampling variance) and the residual is 

(X^) - {e^£-'e) - Tr {£-\ee^)) - N , (CI) 

where N is the number of halo bins. If we only consider halos, the first term from Eq. (|23p reads 

X' - ^;C, ^ Sl£-'S,^ ^ m-'b)' , (C2) 
where we used Eq. in the second equality. Plugging in the model S\i = b5 + e, we get 

- b^£-'bS^ + 2b''£-^eS + e^£-^e - — ^ (b^£-^bd + b''£-^ef = 

^^1^2W£-^A\--^^5-re^£-^e-^{W£-^e)' . (C3) 



A large fraction of the first two terms in the last expression obviously cancel when a 3> 1. The quantity a, also 



denoted as signal-to-noise ratio in 14 1, monotonically increases with the number of halo bins TV. In |14| it was shown 
to reach 0(10^) in the continuous limit. Even higher values can be reached when the mass resolution of the simulation 
is increased Hence, in the limit a ^ 1 the residual of the chi-square in Eq. (|C3[) becomes 

P a a b'£ b 

the same as in Eq. (jCl[) with knowledge of the dark matter. Note that in the case of one single halo bin as in Eq. (12^ . 
a cancellation neither of the underlying dark matter field, nor of the sampling variance is possible. 

Appendix D: HALO MODEL PREDICTION FOR ALPHA, BETA AND GAMMA 

In the halo model the shot noise matrix is given by Eq. (|47l) . In order to invert £, we write £ = A. Mb^ with 
j\ = n~^\. — bM.^ and apply the Sherman-Morrison formula: 

+ (Dl) 
1 - b^A^M ^ ' 
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